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DECISION RULES FOR UNBIASED INVENTORY ESTIMATES 


P. Argentiero 
D. Koch 

ABSTRACT 

This paper presents an efficient and accurate procedure for estimating inven- 
tories from remote sensing scenes. In place of the conventional and expensive 
full dimensional Bayes decision rule, a onc-dimensionai feature extraction and 
classification technique is employed. It is shown that this efficient decision rule 
can be used to develop unbiased inventory estimates and that for large sample 
sizes typical of satellite derived remote sensing scenes, resulting accuracies are 
comparable or superior to more expensive alternative procedures. Mathematical 
details of the procedure are provided in the body of the report and in the 
Appendix. 

Results of a numerical simulation of the technique using statistics obtained 
from an observed LANDS AT scene are included. The simulation demonstrates 
the effectiveness of the technique in computing accurate inventory estimates. 
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DECISION RULES FOR UNBIASED INVENTORY ESTIMATES 


1. INTRODUCTION 

A satellite based remote sensing scene is usually composed of a large number of multidimen- 
sional samples each of which consists of radiation intensities obtained from a specific ground area. 
For the purposes of interpretation and information extraction, it is frequently useful to assume 
that the portion of the Earth’s surface under observation in a scene includes a small number of 
identifiable classes of ground cover. To interpret the scene a decision rule is required which 
allocates the samples among the classes. For many applications, it is important to know how 
each class of ground cover is distributed throughout a scene. When this type of information is 
conveyed by means of a color coded display it is called a thematic map. When thematic mapping 
is needed it is essential for a decision rule to correctly classify as many samples as possible. It 
is known that under a reasonable set of assumptions a Bayes or maximum likelihood classifier 
minimizes the global error rate. Hence, this continues to be the most commonly used classifica- 
tion procedure for applications which require thematic mapping. 

Some applications call for only an estimate of the proportion of a scene accounted for by 
each class of ground cover. This is referred to as inventory estimation. The information needed 
for inventory estimation is different in character from what is needed for thematic mapping. For 
instance, when only the proportions of a scene allocated to each class are estimated, a certain 
amount of cancellation of errors of commission and omission is possible. In the case of thematic 
mapping no such cancellation can occur. Hence, it should not surprise that the minimal error 
rate criterion might not be the most appropriate basis for choosing a decision rule for inventory 
estimation. 
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This paper presents a procedure for inventory estimation which is both efficient and accurate, 
In place of the conventional and expensive Bayes decision rule, a one dimensional feature extractic 
and classification technique is employed. It is shown that this computationally inexpensive dccisio 
rule can be used to develop unbiased inventory estimates and that for the large sample sizes typical 
of satellite derived remote sensing scenes, resulting accuracies are comparable to those of any alter- 
native procedure. Section 2 and the Appendix provide the mathematical details of this approach to 
inventory estimation. Section 3 shows tiie results of a numerical simulation of the technique using 
class statistics obtained from an observed LANDSAT scene. Section 4 provides a summary of the 
results. 
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2. INVENTORY ESTIMATION 


Inventory estimates can be obtained from a remote sensing scene by applying a decision rule to 
each sample and then determining by a simple counting procedure the proportions of the total 
sample set allocated to each class, The quality of the resulting estimates will depend on the choice 
of a decision rule. In order to choose intelligently it is useful to analyze the simple relationship 
which exists between the statistical properties of a decision rule and the expected values of errors in 
inventory estimates. 

Assume that a total of N samples arc to be allocated among K classes. Assume further that a 
decision rule lias been chosen and associate with, this decision rule a confusion matrix C defined as 

C(i,j)-* conditional probability that a sample from 
class j is assigned to class i. 


In addition, define the following symbols: 

Nj number of samples from class i included in 
the total sample set. 

A 

Nj -» number of samples estimated to be in class : 
by applying the decision rule and counting 
the number of samples allocated to class i. 

Ujj -> actual number of samples from class j that 
are assigned by the decision rule to class i 


Our first relationship is 


A 




"hi 


( 1 ) 


The symbol n. ; represents a binomial random variable witli mean and variance given by 

J 


E(n u ) = NjC(iJ) 


(2a) 
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E([E(n jfJ ) — n i j ] 2 ) = NjC (i, j) [ 1 - C (i, j) ] 


(2b) 


Hence 


Define 


Equation 3 yields 


E(Nj). £ NjC(iJ) 

j=i J 


N, 



(3) 


E(P|) = ^ Pj C (i, j) ((4) 

At tliis point it is useful to introduce matrix notation. Define K dimensional column vectors P and 
Pas 



Pi" 


— A — 1 
? 

p -v 

P2 

9 

* 

A 

, P - 

P2 

• 

• 
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The symbol Prepresents a vector of correct proportions of the scene accounted for by each of the 

A. 

K classes. The symbol P represents the proportions estimated by applying the decision rule and 
counting the number of samples allocated to each of the K classes. The information included in 
equation 4 can be expressed in matrix form as 


CP = E CP) 


(5) 
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Equation 5 implies that inventory estimates obtained from the application of a gi 
rule are unbiased if and only if the associated confusion matrix is the identity matrix. 

Assume that the confusion matrix C is invertible and define a vector P_ of modifie 
estimates as 

P_ = C" 1 P 

It follows that P. is an unbiased estimator of P. Let D represent the covariance matrix ■ 

E([P_-P] [P_-P] T ) = cr 1 Deer 1 ) 1, 

From the definition of P and equations I and 2b, one can show that the ith diagor 
of D is 

1 K-l 

D(i,i) ^ Ns C (i,j) (1 — C(ij)) 

fyt i“l 1 

Since 

ccij)(i ~c(u))<y4 


it follows that an upper bound on a diagonal element of D is 


D(i,i) 


4N 


We have shown that a sufficient condition for the derivation of unbiased inventory estin 
use of a decision rule whose confusion matrix is computable and invertible. The varianc 
ing unbiased inventory estimates arc bounded by terms which are proportional to ’AN wl 
total sample size. Hence for large sample sizes typical of satellite derived remote sensing 
inventory estimates given by the right side of equation 6 should be very accurate. 
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Based on the results of this section, we can specify the following criteria for choosing n deci- 
sion rule for inventory estimation 

A) computational efficiency 

B) the computability of the associated confusion matrix 

C) the invertibility of the associated confusion matrix. 

With regard to criterion B it is worth mentioning that there is no available practical algorithm for 
computing the confusion matrix of a multidimensional Bayes classifier. For this reason as well as 
for reasons of computational efficiency, the conventional Bayes or maximum likelihood decision rule 
is not an optimal choice for use in inventory estimation. 

In order to satisfy criterion A, our search for an appropriate decision rule was limited to those 
which employ dimensionality reduction. A simple example of dimensionality reduction is the selec- 
tion of certain subcomponents of the sample vectors for use in classification. This is referred to as 
feature selection with the components as features. On a more sophisticated level, one can introduce 
a priori information or specific problem knowledge to determine relationships among sample vector 
components. These relationships implicitly define a dimensionality reducing transformation. This 
procedure is called feature extraction . But since useful problem knowledge is not always available, 
the feature extraction approach preferred for our application depends entirely on training sample 
statistics. With this approach one attempts to find the dimensionality reducing transformation con- 
ditioned on training sample statistics which optimizes a performance measure. The advantages of 
this type of feature extraction when applied to remote sensing data are its ability to be implemented 
in an autonomous or non-interactive fashion, and its interdisciplinary character. 

In the case of a one dimensional Bayes decision rule it is possible to compute the associated 
confusion matrix. Details of the computation are provided in the Appendix. Hence, in order to 
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satisfy criterion B attention ean be restricted to linear one dimensional feature extraction and Bayes 
classification procedures. This approach to inventory estimation can be outlined as follows: 

A) linearly map ail samples onto one dimension. 

B) obtain one dimensional a priori class statistics from training sample statistics by applying 
the usual laws for the behavior of first and second order moments under linear trans- 
formations. 

C) employ a normality assumption and allocate the one dimensional samples among classes 
using the conventional Bayes or maximum likelihood decision rule. 

D) construct a vector of inventory estimates by determining the proportion of the total 
sample set allocated to each class 

E) use the algorithm described m the Appendix to compute the confusion matrix of the one 
dimensional Bayes decision rule. 

F) prcimiltiply the vector of inventory estimates by the inverse of the confusion matrix to 
obtain unbiased estimates. 

If the procedure outlined in steps A through F ..ere to begin with a one dimensional linear 
transformation chosen more or less at random the probable result would be failure at step F because 
the confusion matrix in question is apt to be singular. Hence, to satisfy criterion C a computationally 
efficient procedure is required for choosing the coefficients of a one dimensional linear transforma- 
tion such that tiie resulting confusion matrix used in step F is invertible. Since the elements of a 
confusion matrix arc positive and the columns must sum to one, a logical candidate is the feature 
extraction whose associated confusion matrix has maximum trace. Notice that this choice also lias 
the property that among all possible linear one dimensional feature extractions, it provides the mini- 
mal error rate when a priori class probabilities are equal. In this sense, it is the optimal one dimen- 
sional feature extraction. The coefficients of the optimal one dimensional feature extraction can be 
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obtained by means of a conventional multidimensional optimization procedure which is outlined in 
the Appendix, This feature extraction is recommended, for use in the inventory estimation process 
described in steps A through F and is used in the numerical example of Section 3. 
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3. A NUMERICAL SIMULATION 


The feature extraction and classification scheme described in Section 2 and the Appendix 
was incorporated into a computer program on the HP9825A programmable desk calculator. The 
program is in the HP basic language and occupies about 4000 bytes of the available memory. The 
input to the program consists of four dimensional mean vector and covariance matrices, and con- 
vergence control parameters for the Newton-Raphson optimization procedure. The output of 
the program consists of a four dimensional vector V which defines the one dimensional feature 
extraction, the means and variances of the resulting one dimensional classes, and the confusion 
matrix. 

For this numerical simulation class statistics were obtained from a LANDSAT 2 scene obtained 
over Finney County, Kansas during May of 1 975 [ 1 ] . The Five classes consisted of two types of 
winter wheat and litre. . in fuser crops. The class statistics were obtained from well known sites in 
Finney County. T? a * four channels are those of the Multispcctral Scanner on board the LANDSAT 2. 
The .,izcs of the training sample sets range from about one hundred to about three hundred. The 
class statistics are shown in Tabic I . Among the several factors which limit the accuracy of a Bayes 
decision rule in classifying remote sensing data are 

A) Significant deviations of class populations from normality 

B) Errors introduced by small sample sizes 

C) Training samples are not randomly selected from the populations in question and, thus, are 
not representative. 

Because this simulation ignores such factors, misclassification probabilities will be somewhat opti- 
mistic compared to what might be obtained in an actual application. To measure the quality of our 
feature extraction and classification procedure it was decided to compare its performance to that of 
a Bayes decision rule operating on all four channels of the Multispcctral Scanner data. This decision 
rule is optimal in the sense that it minimizes global misclassification probability. The performance 
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Tabic I 

Statistics o( 1 ANDSAT-2 MSS Signatures Acquired May 1975 Over 
Finney County, Kansas 


( I ) 184 Pixels of Non-Wheat 


Channel Mean Std. Dev. 

Covariance Matrix 


1 27.7 3.6 

2 24.5 8.0 

3 75 1 20.4 

4 37.4 12.0 

25.0 63.4 Kl£ TR, c 

-51.4 -140.7 415.5 

-30.8 -84.2 242.1 143.4 

(2) 333 Pixels of Non-Wheat 

1 34.7 3.6 

2 40.4 5.5 

3 47.0 5.2 

4 19.7 2.5 

12.7 

17.2 30.0 

8.8 9.9 27.3 

0.6 -1.2 10.4 6.0 

(3) 324 Pixels of Non-Wheat 

1 33.3 1.6 

2 38.5 2.7 

3 44.1 6.4 

4 18.7 3.3 

2.6 

2.6 7.2 

4.3 2.5 41.2 

1.9 0.3 19.9 11.1 

(4) 106 Pixels of Winte: .Vheai 

1 28.5 2.4 

2 27.5 4.0 

3 51.2 5.2 

4 24.0 3.0 

5.8 

7.4 16.2 

-6.0 -14.4 26.7 

-4.3 -8.9 14.1 9.0 

(5) 1 27 Pixels of W'inter Wheat 

1 21.5 2.7 

2 16.7 4.2 

“ 54.9 5.1 

4 29.1 2.8 

7.3 

10.3 18.0 

4.1 4.9 26.0 

-1.0 -2.8 11.4 8.1 
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of flic four dimensional Bayes classifier was determined by means ol a monte carlo program written 
for the HP*>825A programmable desk calculator. The assumptions of the monte carlo simulation 
were 

A. The five class populations are normally distributed 

It. The class statistics are those of Table I 

(*. A priori class probabilities are equal 

I). Monte carlo sample sues are sufficiently large that sampling error is insignificant. 

One thousand samples were obtained from each class and classified into one ol the five classes 
according to a four dimensional Bayes decision rule as described in the Appendix. The element in th 
tth row and the ith column of the confusion matrix was estimated as the proportion of samples cho- 
sen from the jth class which were assigned to the ith class. The estimated confusion matrix is shown 
in fable 2. flic table also shows the expected values of inventory estimates as obtained from the 
four dimensional Bayes decision rule. In this case expected values of inventory estimates can be ob- 
tained by averaging the rows of the confusion matrix. From Tabic 2 it is also seen that the com- 
bined miscljssificalion probability for classes 4 and 5 which are associated with winter wheal is 0.1. 


Table 2 

Confusion Matrix for Four Dimensional Bayes Classifier 


Class 

1 


3 

4 

IT, 

1 

0.81 

0.01 



o 

b 

u 





0.03 

0 

3 


Pj • 

0.84 

0.03 

0 

4 

1 

0.07 

0.03 

0.88 

0.06 

5 

0.02 

0 

0 


0.02 

expected 
Values of 
Inventory 
1 stimatc 

0.17 

0.15 

0.26 

0.22 

0.20 




















The statistic* shown in Table I were used as input to the one dimensional feature extraction 
and classification program, l or this simulation a / nior! class probabilities were assumed to he 
equal. Hence the correct value for each class inventory estimate is 0.2. The resulting confusion 
matrix along with expected values of inventory estimates are shown in Table 3. The confusion 
matrix was computed analytically according to the methods described in the Appendix. The 
combined misclassification probability for classes 4 ind 5 is 0.14. However, expected values of 
inventory estimates lor winter wheat classes 4 and 5 are seen to be considerably worse than cor- 
responding estimate lor the 'ptunal four dimensional Bayes classifier. The primary reason is 
errors of commission introduced by m ^classification ol samples l*om class I into classes 4 and 5. 
But section 2 describes how the confusion matrix can be used to correct inventory estimates to 

A 

yield unbiased estimate -. V in Section 2. let I’ be a five dimensional vector representing inven- 

As 

tory estimates for .lie five da ses. Repeating Equation 0. an unbiased inventory estimate P_ for 
these classes can be obtained as 

P. = C -1 P (II) 

where C is the confusion matrix given in Table 3. 


Table 3 

Confusion Matrix for One Dimensional Feature Extraction and Bayes Classifier 


Class 

1 


3 

4 

5 

1 

0.22 

0 

0 

0 

0.04 


0.07 

0.39 

0.20 

0.03 

0 

3 

0.04 

0.54 

0.78 

0.04 

0 

4 

0.34 

0.07 

0.02 

0.84 

0.08 

5 

0.34 

0 

0 

0.09 

0.88 

Expected 
Value of 
Inventory 

0.05 

0.14 

0.J8 

0.27 

0.26 

Estimate 





a 




An explicit form for C 1 is 


4.85 

- 0.01 

0 

002 

-0.22 

-1.02 

4.00 

-1.02 

•0.10 

0.06 

0.54 

-2.75 

1.99 

0.01 

-0.03 

-1.73 

-0.27 

0.04 

1.20 

-0.03 

•1.70 

-0.03 

0 

-1.13 

1.23 


From Table 3 we havj 


C(P) 


0.05 

0.14 

0.28 

0.27 

0.26 


( 13 ) 


Also 


FXPJ = C~* E(P) (14) 

Equations II, 12, 13 and 1 4 yield 

E(PJ = 

Equation I 5 is a logical consequence of the unbiased property of P_ as an estimator. But the results 
of this simulation serve as a useful numerical check on the validity of our development. 


0.2 

0.2 

0.2 

0.2 

0.2 


(15) 
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4. SUMMARY 


The results of this paper suggest that a decision rule used for inventory estimation should 
satisfy the following criteria 

A) computational efficiency 

B) the computability of the associated confusion matrix 

C) the invertibility of the associated confusion matrix. 

When criteria B and C are satisfied, it is shown that unbiased inventory estimates can be constructed. 
The variances of these estimates are bounded by terms which are proportional to 1/N where N is the 
total sample size. Hence, for the large sample sizes typical of satellite based remote sensing scenes 
very accurate inventory estimates should be achievable. 

The computational procedure recommended in this paper and which satisfies the above criteria 
is outlined in steps A through F as given in Section 2, with the one dimensional feature extraction 
chosen so as to maximize the trace of the associated confusion matrix. The mathematical details of 
the confusion matrix computation and the required optimization process are included in the Appen- 
dix. 


The computational algorithms needed for this approach to inventory estimation were program- 
med for a small desk top computing machine and a simulated inventory estimation was performed 
using class statistics obtained from an observed LANDSAT scene. The simulation demonstrated the 
unbiased nature of resulting inventory estimates. 
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APPENDIX 


OPTIMAL ONE DIMENSIONAL FEATURE EXTRACTION 

Assume K normal random variables 77 (t^j , Sj), i = 1 , 2, . . . K each of dimension N. Samples 
are chosen from these populations according to a probability law defined by a priori class proba- 
bilities Pj, 1 = 1 , 2, , . . K. The problem is to classify each sample by assigning it to the class from 
which it was chosen. In this case the Bayes classifier can be obtained by defining K functions, each 
with an Ndimcnsional argument 

S,(X) = (X-7?,) 1 S," 1 (X - Jjj) + Un l£ t !- 2£n P|, i = 1 , 2, . . . K (1) 

Assign a sample X to the class indexed by i if and only if 

Sj (X) < Sj (X) for all j not equal to i. (2) 

The decision rule defined by equation 2 can be shown to minimize risk under a zero-one loss matrix 
[2] . It also maximizes likelihood. Hence, by the Neyman-Pearson Letnma [3] , it minimizes the 
global probability of misclassification. 

If the number of samples to be classified is large and if the integers N and K are relatively large, 
the above defined decision rule can be expensive to implement. An alternative is to map the N di- 
mensional samples to a one dimensional space and perform the Bayes classification in one dimen- 
sion. Let V be the N dimensional vector which defines the transformation. Each N dimensional 
sample X to be classified is mapped onto a scalar x by 

X = V T X (3) 

The classification problem can be redefined in the following fashion. Assume K one dimensional 
normal random variables 77 Oq, cq), i - 1 , 2, . . . K. The required means and variances are defined as 
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Mi = V t t?|, i= 1, 2, ... k: 
of «* V T S, V,i= 1,2,...K 


(4a) 

(4b) 


The scalar X defined by equation 3 can be considered as having been chosen from one of the above 
defined populations according to a probability law defined by a priori class probabilities P ( , i = l , 

2, , . . K. Define the K functions 

f, (X) = — 2 - ! - + fin of - 2fin P, f i = 1, 2, . . . K (5) 

The scalar X, and hence the vector X, is assigned to the class indexed by i if and only if 

fj (X) < fj (X) for all j not equal to i. (6) 

For every vector V, equations 3 through 6 provide a well defined decision rule with which one 
can associate a global misclassification probability P. Our object is to obtain a V which minimizes 
P. Represent the relationship between V and P as 

P = F(V) (7) 

We require an explicit and executable algorithm for the function symbolized in equation 7. Given 
that, conventional multidimensional optimization techniques can be used to complete our task. 

For any vector V, associate a so-called confusion matrix D v defined as follows: 

D v (ij j] "*■ conditional probability that a sample chosen from (8) 

class i is assigned to class j under the decision rule 
defined by equations 3 through 6 


It follows that 

P = 1 - h Ps D v [i, i] 
1=1 


( 9 ) 
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For equation 9 to be of use, a computable expression for the elements of D v must be provided. 

For simplicity of exposition we assume that if i j* j, aj aj. It is not difficult to modify the results 
to account for duplications. 

For each j, define the set Tj as follows 

Tj = | all x such that fj (x) < f| (X), i = 1 , 2, . . . K j (10) 

Then 

D v [i,jl - / nUti,Oi)AX (11) 

T J 

It remains to provide an explicit description of the sets Tj, j ~ 1 , 2, ... K. 


For each i < K and j < K define the sets $ j as 

j3 ( j = | all X such that fj (X) < f| (X) } (1 2) 

Tlicn 

(13) 

From equation 5 and 6 it follows that the boundary points of $ j are the solution of the quadratic 
equation 

Ax 2 + BX + C = 0 (14) 

with 


A = 



B 



C 




(15a) 


(15b) 


(15c) 
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Define the boundary points of /Jy ns 


Hence, 


T, (iJ = min 


-B + V r B y -4AC -13 VB 2 -4AC 


2,i,J 


max 



| T'j.ij] if 0} < 0| 

T t,ui u n' 2 , f j,'«) if oj > a ( 


06a) 


06b) 


07) 


It will be convenient to represent the set of boundary points of an arbitrary set S by the symbolism 
b (s). From equation J 7 

b (ftj) = T|,i,j u T 2f i,j (18) 


From equations 13 and 17 it follows that the set Tj can be represented as the disjoint union of a 
finite set of intervals and that 


b(Tj) C yb (Aj) 09) 

When the dements of b (Tj) are linearly ordered according to size, left sided and right sided bound- 
ary points must alternate. Hence, to reconstruct the set Tj it is sufficient to know which elements 
ofUb tftj) are elements of b (Tj)and the classification of the smallest clement in b(Tj) as either a 
left sided or a right sided boundary point. For each element z eubtftj), z is placed in b(Tj) if and 
only if for each i < K 


T l,i,j <z<T 2.iJ»«f £7 j < a i 
7 ^ T UJ or z> T 2 ( jj, if Oj> a, 


( 20 ) 


Order the elements of b (Tj) by increasing size to create the indexed set {sj >m J . There exists an 
C < z and an i < K such that 
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s j,l c T 2*IJ 


( 21 ) 


Gassify Sjj as cither a left sided or right sided boundary point according to the following rule 


£ ° 1 and Oj < 0 | 
£ = 2 and aj > a, 


-Heft sided 


£ = 1 and 0 | < Oj 
£ = 2 and o ( > <Jj 


-► riglit sided 


( 22 ) 


In order to conveniently compute the integral on the riglit side of equation 1 1 , transform the indexed 
set {sj >m | into the indexed set {rj (in } by the mapping 

r j,m “(Sj.m -P|)M (23) 

Let Q be the index of the largest element in |rj,m} and let d(x) represent the distribution function 
of the standard nornr i random variable. Then D v [ij J can be computed as follows when Sj (1 is a 
right sided boundary .mint 

D v [i>ii = I ~d(r jjQ ) + d(r jtl )-h Q £ d(r. 8+1 ) - d(r jt£ ) (24) 

and when Sj.j is a left sided boundary point 

Dyli.j) = Q S d(r JtS+1 )-d(n*) (25) 

fi=i 

Equations 8 through 25 give explicit expression to the functional relationship symbolized by 
equation 7. The vector V which minimizes P can be estimated by a conventional multidimensional 
Newton-Raphson h oration procedure [4] . Tills procedure requires all first and second order deriv- 
atives of P with respect to the elements of V. These derivatives can be obtained analytically by 
implicit differentiation and the application of the chain rule on equations 9, 15, 16, 23, 24, and 25. 

A sufficiently accurate first guess is also required. An adequate first guess can be obtained as follows. 
The Fisher criterion [5] for class separability between classes indexed by i and j is given by 
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f|j = (M| -Mj) 2 /(0| 2 +Oj 2 ) 


(2o> 


The feature extraction procedure which maximizes Ijj is defined by the vector V, , which is 


V ld =(>/,£, + '/ i 2; J r 1 (r? i hj) 


(27) 


Tiie first guess V 0 is obtained as the average V i(j over all pairs (i, j), i < K, j < i. Hence 


2 < K. — 2)1 x H 

V = — v v v 

° K! 1^2 1^1 i,J 


(28) 


For some applications, the correct classification of samples of certain types may be more im- 
portant than correct classification of samples of other types. Hence, it is worthwhile to introduce 
a simple generalization to the feature extraction procedure defined by equations 3 through 28. Let 
G be a K dimensional vector consisting of zeroes and ones. Instead of choosing a V to minimize the 
global probability of misclassification as defined by equation 9, choose a V which minimizes 


Pc 


K 

v 

i“l 


G(i) P, (! 


D v M. jl) 


(29) 


The symbol represents the probability of misclassification of samples chosen from all classes 
indexed Ly an integer i such tiiat G(i) = 1 . The generalization is obtained by substituting equation 
29 for equation 9. The rest of the development given by equations 10 through 28 is left undisturbed. 
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